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IN-SEED EXPRESSION OF LYSOSOMAL ENZYMES 
DESCRIPTION "~ 

Field of the invention 

The present invention relates to the production of 
5 transgenic plants able to express, in seed storage 
tissues, a lysosomal enzyme in enzymatically active form 
and in amounts appropriate to its use in enzyme 
replacement therapy . 

In particular, the present invention relates to 

10 plants and seeds containing this enzyme. 
Prior state of the art 

Lysosomal diseases represent a wide class of human 
genetic diseases determined by malfunctions of specific 
lysosomal enzymes. Lysosomes are the main degradative 

15 organelles of animal cells and are of critical importance 
in degrading macromolecules and in recycling their 
monomeric components. These membrane-delimited organelles 
exhibit acidic pH and contain a variety of nucleases, 
proteases, phosphatases and degradative enzymes for 

20 polysaccharides, mucopolysaccharides and lipids. A 
malfunction of a specific acid hydrolase determines an 
aberrant accumulation of the substrate in the lysosomes, 
causing a variety of pathologies, among which there may be 
mentioned: Tay-Sachs disease, due to a deficiency of the 

25 enzyme p-N-hexosaminidase, Mucopolysaccharidoses (MPSs) a 
group of recessive disorders due to a malfunction in the 
degradation of complex sulphurates, Anderson- Fabry 
disease, due to a deficiency of the enzyme a-galactosidase 
A causing accumulation of globotriaosylceramide mostly in 

30 renal microvascular endothelial cells, Pompe disease, due 
to a deficiency of the enzyme acid cx-glucosidase leading 
to intralysosomal accumulation of glycogen and Gaucher 
disease, due to def iciences in the enzyme 
glucocerebrosidase (acid p-glucosidase or GCB) that 

35 determines the accumulation of glycosphingolipids mainly 
in cells of monocyte-macrophage lines. 

In particular, Anderson-Fabry disease is a dominant 
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X-linked disease resulting from the malfunction of the 
enzyme a-galactosidase A (a-GalA or GLA) , which leads to 
the progressive accumulation of globotriaosylceramide (GL- 
3) and of the related glycosphingolipids causing 
5 proteinuria in young males and subsequently, with age, 
kidney failure. 

Human a-galactosidase A is a 429 amino acids (aa) 
homodimeric glycoprotein, with four putative glycosylation 
sites, whose signal peptide is represented by the 31 N- 

10 terminal residues. The cDNA sequence of GLA has been 
published by Tsuji S. et al., 1987, Eur. J. Biochem., 
165(2), 275-280. 

Pompe disease, or glycogen storage disease type II, 
is an autosomal recessive metabolic myopathy caused by a 

15 deficiency of the enzyme acid p-glucosidase (GAA) which 
leads to storage of glycogen in almost all tissues, 
specifically injuring heart and skeletal muscles. 

Acid P-glucosidase is a 952 aa glycoprotein having 
seven putative glycosylation sites, whose signal peptide 

20 is represented by the 69 N-terminal residues. cDNA 
sequence has been published by Hoefsloot L. H. et al., 
1988, EMBO JOURNAL 7 (6), 1697-1704. 

Lastly, Gaucher disease is an autosomal recessive 
disorder caused by a deficiency of glucocerebrosidase, an 

25 enzyme required for degradation at lysosomal level of 
lipids containing covalently bonded sugars (Brady et al. 
1965, J. Biol. Chem., 240: 39-43). In the absence of 
glucocerebrosidase, the insoluble compound 

glucocerebroside (glucosylceramide) accumulates in the 

30 lysosomes leading to the disease symptomatology. 

Human glucocerebrosidase is a glycoprotein having a 
molecular weight ranging from 58 to 70 kDa, with five 
putative glycosylation sites. The complete cDNA has been 
reported (Sorge et al., 1985, Proc. Natl. Acad. Sci. 82: 

35 7289-7293) . The open reading frame, having 1545 base pairs 
(bp) plus several introns, codes for a protein having 515- 
amino acid, the 19 amino acids of the peptide signal 
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Success attained by enzyme replacement therapy in 
lysosomal diseases like Gaucher disease underlined the 
demand for more effective, safe and economical production 
5 methods of lysosomal enzymes in general. 

To date, some of these enzymes have been directly 
extracted from human placenta or produced, by genetic 
engineering, in CHO (Chinese hamster ovary) cell cultures. 
Both of the abovementioned production methods, besides 
10 being very costly, entail risks for the patient. In fact, 
following the use of placenta-extracted enzymes, entailed 
viral infections have been observed, whereas following the 
use of enzymes extracted from CHO cell cultures, in the 
15% of the patients production of specific antibodies 
15 against the product was detected, whose origin could 
relate to the production system adopted. 

Only recently some lysosomal enzymes have been 
produced in genetically modified plants. 

Human lysosomal enzymes can be produced in transgenic 
20 plants in order to solve problems of safety, viral 
infections, immune reactions, production yield and cost. 

U.S. Patent 5,929,304 reports the "in-leaf" 
production of some lysosomal enzymes (human 
glucocerebrosidase and human oc-L-iduronidase) , optimally 
25 generated in tobacco. 

In fact, tobacco is particularly indicated as a model 
system for the production of high-value recombinant 
proteins. Heterologous genes can be introduced in 
totipotent cells of tobacco using nonpathogenic strains of 
30 Agrobacterium tumefaciens . The subsequent growth and 
differentiation of the transformed cells leads to the 
attainment of stable transgenic plants. Large amounts of 
leaf material from transgenic plants are yielded in 6-7 
weeks, whereas first-generation transgenic seeds, in 
35 amounts vastly greater than those of the leaf material, 
become available in other 2-3 months. 

U.S. Patent 5,929,304 describes the expression of 
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some lysosomal enzymes in tobacco plants. The enzymes are 
produced essentially in leaf by plants transformed via the 
use of vectors containing the MeGa promoter (deriving from 
the tomato HMG2 promoter) or the cauliflower mosaic virus 
5 (CaMV) 35S promoter. U.S. Patent 5,929,304 reports, among 
the promoters indicated as useful in carrying out the 
invention, besides the abovecited ones, also the rbcS 
promoter, the chlorophyll-binding a/b protein, the AdhI 
promoter and the NOS promoter. According to the teachings 

10 of this latter patent, the promoter used may be 
constitutive or inducible. In-leaf expression of 
enzymatically active human D-L-iduronidase is described in 
all the examples reported in the patent. The expression 
occurs following the transformation of tobacco plants with 

15 vectors containing, besides the sequence coding for the 
desired enzyme, the mechanically inducible MeGA promoter, 
or the constitutive 35S promoter, respectively. 

As reported in the examples, the yield of human 
glucocerebrosidase in tobacco leaves is equal to about 1.5 

20 mg of extractable protein per 1000 mg of tissue, i.e. the 
yielded enzyme is equal to the 0.15% b/w of the initial 
leaf material. This result, obtained by in-plant 
expression, has been indicated by the inventors as the 
long-sought solution for the production of animal or human 

25 lysosomal enzymes, thereby meeting the demand for said 
enzymes in replacement therapy. 

However, in the leaf protein concentration is known 
to be extremely low, and anyhow lower than the one in seed 
. Moreover, it has to be borne in mind that in the case of 

30 in leaf enzyme expression there subsists a stability 
problem, due to the fact that enzymes are proteins 
extremely sensible to denaturating factors. Said stability 
problem requires the continual and prompt harvesting of 
the leaf tissue after the mechanical induction of the 

35 promoter, the immediate freezing at -20°C of said tissue 
and the discarding of an enormous amount of material in 
the enzyme-extracting step. Moreover, the in-leaf 
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expression allows production of a very low amount of 
proteins, anyhow lower with respect to an in seed 
production of the same. Hence, it would be useful to 
provide a seed-specific system for the expression of 
5 lysosomal enzymes. 

Although the teachings of U.S Patent 5,929,304 
theoretically relate also to the expression of the enzyme 
in plant tissues other than the leaf one, it has been 
verified by the present Inventors, while attempting at 

10 expressing acid p-glucosidase in seed, that the 35S 
promoter identified in said patent as one of the 
promoters suitable for this purpose is unable to direct 
in-seed accumulation of a stable protein (Figure 12) . As 
it is subsequently reported in the comparative example, 

15 tobacco plants were transformed using an expression 
vector comprising the cDNA sequence of the human 
glucocerebrosidase gene, coding also for the native 
signal sequence of the enzyme, under the control of the 
35S promoter. Western Blot analysis under 

20 chemiluminescence demonstrated the absence of the desired 
enzyme from seed tissues. Hence, U.S patent 5,929,304, 
although declaring the described method as suitable for 
in-seed expression, does not provide the teachings 
required in order to carry out said expression to a 

25 person skilled in the art. 

In seed expression of heterologous proteins 

Overall, seed constitutes the vegetal organ most 
widely used by humans for its caloric and protein 
contributions. Storage function for the nitrogen component 

30 is carried out by specific proteins accumulated in protein 
bodies, inside compartments in endocellular membrane 
systems. In seed, protein amounts range from about the 10- 
15% b/w in cereals to about the 25-35% in Leguminosae, 
whereas in tobacco they are equal to about the 20% b/w. 

35 Hence, protein accumulation should be directed in seed 
endosperm in order to increase the production yields of in 
plant expressed proteins. 



WO 03/073839 



PCT7IT03/00120 



6 

International Patent Application WO-A-00/04146, to 
the same applicant, described the in-seed expression of 
protein lactoferrin. This result was obtained by modifying 
tobacco and rice plants with vectors containing an 
5 expression cassette. The cassette comprises a promoter, 
and a DNA sequence coding for the signal sequence of 
storage proteins highly abundant in seeds and stage- 
specifically expressed, i.e. Pconglycinine or basic S7 
globulin, and a DNA sequence coding for lactoferrin. WO-A- 
10 00/04146 details the vectors used and the promoters 
selected. 

Lactoferrin is a glycoprotein able to bind iron and 
usually expressed in human milk. This protein exhibits an 
extremely high stability, given that lactoferrin- 

15 containing solutions may be treated even for 5 min at 90°C 
without detecting significant losses in protein activity. 
Pat. Appln. W0 00/04146 advances, by way of hypothesis, 
also the expression of proteins exhibiting a generically 
enzymatic activity using the method practiced with 

20 lactoferrin. However, this possibility remains purely 
theoretical, as no data is provided supporting the 
validity of the system adopted in said Patent Application 
for the expression of functionally active enzymes, even 
less so of lysosomal enzymes. 

25 Pat. Appln. WO-A-00/04146, it being actually not 

aimed to enzyme expression, neglects several problems of 
fundamental relevance related to the stability of the 
hexogenous protein expressed. In fact, the utmost 
stability of lactoferrin and the fact that in lactoferrin 

30 the presence of non-natural glycosidic chains does not 
influence protein folding and function, definitely bars 
the use of the system disclosed in Pat. Appln. WO-A- 
00/04146 from effectively expressing enzymes exhibiting a 
correct folding and being functionally equivalent to the 

35 native enzyme. Moreover, it is known that in the case of 
the acid Oglucosidase the glycosylation of the first of 
the five glycosylation sites in the enzyme is crucial to 



WO 03/073839 



PCT/IT03/00120 



the generation of an active enzyme; said function pertains 
to the third glycosylation site in the case of □ 
galactosidase A, and in the case of acid p-glucosidase, 
the second glycosylation site is the one crucial to the 

5 generation of the mature enzyme. 

Another problem lies in that the protein instability 
could be due to structural restraints, but also be a 
consequence of its subcellular location. For the 
expression of lysosomal enzymes the fact should be taken 

10 into account that plant cells do not possess lysosomes, 
using the vacuole as a functional analogue of these acid 
vesicles for polymer degradation. In humans, the main 
pathway for transferring soluble enzymes to lysosomes 
implies the interaction with mannose-6-phosphate receptors 

15 (MPR) . Lysosomal proteins are dispatched to the 
endoplasmic reticulum (RE) by N-terminal signal peptides, 
and are glycosilated during their transfer to Golgi 
apparatus. In this organelle, the N-linked glycans of the 
enzymes destinated to the lysosomes are selectively 

20 phosphorylated to one or more mannose residues. Then, the 
mannose-6-phosphate can bind to the membrane receptors 
which ensure glycoprotein internalization. 

Plants lack the targeting system based on mannose-6- 
phosphate, do not possess MPRs and do not apppear to 

25 produce phosphorylated glycans. Since lysosomal enzymes 
function in an acidic environment (pH <4) and are unstable 
at higher (^7) pH values, the pH (5.8) of the plant 
(extracellular) apoplastic compartment makes the secretion 
of said enzymes in said compartment ideal for their 

30 stability. Although it is inferrable that human lysosomal 
glycoproteins may be secreted in the apoplastic space 
(which would be particularly suitable for the storage of 
lysosomal enzymes), it does not follow that the system 
disclosed in the abovecited Patent Application be able to 

35 vehicle lysosomal enzymes in the apoplastic space of the 
seed storage tissue. For the abovementioned reasons, this 
localization could provide stability to expressed enzymes; 
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in fact, an accurate localization of the lysosomal enzymes 
inside plant tissues is of fundamental importance for the 
stability of the produced enzyme. 

Moreover, as it is known, unlike lactoferrin, enzymes 
are in most cases extremely unstable proteins requiring 
low-temperature storage, being easily inactivated by heat 
or denaturating agents. 

In the light of these open questions, a person 
skilled in the art would hardly find and reckon in WO-A- 
00/04146 the technical support required to successfully 
express active lysosomal enzymes in storage organs of 
plant seeds. 

Object of the present invention is to solve the 
problems left unsolved by the abovementioned state of the 
art. In particular, object of the present invention is to 
produce an expression system allowing the generation of 
transgenic plants able to express in seed a lysosomal 
enzyme being stable and in an enzymatically active form. 
A further object of the present invention is to produce 
such enzyme in an amount greater than that obtainable in 
leaf, i.e. greater than 1.5 mg per gram of tissue used. 
Summary of the Invention 

The invention is based on the unexpected discovery 
that lysosomal enzymes, of animal or human origin, can be 
advantageously expressed in seed storage organs in a form 
which is stable (stability exceeds 12 months in 
appropriately stored seeds), enzymatically active and in a 
high amount suitable for a medical use of said enzymes. 

This amount is not lower than the 0.8%, preferably 
than the 1%. In an optimal form, the yield is of about 
the 1.5% of the total seed proteins, in order to use said 
seeds as storage and preservation means of such enzyme. 

Moreover, regardless from the amount of expressed 
enzyme, the effective storage of the enzyme over lenghty 
periods (>12 months) enables to plan production at more 
favourable season periods, something not viable with in- 
leaf production, which requires immediate extraction and 
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purification of the enzyme. 

Within the present description, with the term 
"enzymatically active form" it is meant that the enzyme 
is functionally active and that its activity is 
5 equivalent to that of the native enzyme. 

Hence, object of the present invention is a 
genetically transformed plant able to produce a lysosomal 
enzyme of animal or human origin, characterised in that: 

- said plant is transformed via the use of a 
10 recombinant expression vector comprising: 

a. a promoter of a plant gene specific for the 
expression in seed storage organs and stage- 
specific; 

b. a DNA sequence encoding the signal sequence 
15 of a plant protein able to dispatch said lysosomal 

enzyme to seed storage organs and to provide the 
post-translational modifications required for the 
expression of the enzyme in active form; 

c. a DNA sequence encoding said lysosomal 
20 enzyme deleted of the sequences encoding the signal 

se q uence Q f the native enzyme and the sequence 
encoding the native polyadenylation signal; 

d. and a polyadenylation signal; 

- said enzyme is expressed in seed storage tissues 
25 in enzymatically active form and in an amount of at least 

the 0.8%, preferably of the 1%, of the total proteins of 
the seed. 

Objects of the present invention are also the method 
for producing said plant by transforming plant cells with 

30 the abovedescribed vector and by regenerating plants from 
said transformed cells, the seeds produced by said plant, 
the method for producing said seeds, the method for 
purifying the enzyme from said seeds, the use of said 
seeds for the preparation of medicaments for enzyme 

35 replacement therapy and the use of said seeds as means 
for storing and preserving a lysosomal enzyme in 
enzymatically active form. 
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Detailed description of the figures 

Figure 1 shows the plasmid named pGEM-GCB obtained 
from the initial cloning of the GCB gene and used for the 
control of the complete sequence of the amplified gene. 
5 pGEM-GCB is obtained by cloning the fragment 
corresponding to the GCB gene having sequence SEQ ID NO 1 
(obtained by cloning total RNA from human placenta by RT- 
PCR (reverse transcriptase PCR) with the primers having 
SEQ ID NO 3 and 4) amplified with the primers having SEQ 
10 ID NO 4 and 5 in pGEM®-T plasmid (Promega) in the EcoRV 
linearization site, and then delimiting the GCB gene 
between SacI and Smal sites, in primers having SEQ ID NO 
5 and 4, respectively. The primer having SEQ ID NO 5 
allows the deletion of the DNA sequence portion encoding 
15 the human signal sequence from SEQ ID NO 1. 

Figure 2 shows the plasmid named pPLT2100, used to 
construct the PGLOB-GCB chimeric gene. pPLT2100 is 
obtained by inserting, between the Smal and SacI sites of 
the pUC19 vector polylinker (EMBL Acc. N. X02514) , the 
20 fragment corresponding to the GCB gene cloned in the 
plasmid of Figure 1 cleaved with the same enzymes and the 
PGLOB promoter (SEQ ID NO 6} plus the sequence coding for 
the signal sequence of basic soy globulin (SEQ ID NO 7) 
cloned between the Xbal and BamHI sites. 
25 Figure 3 shows the plasmid named pPLT4000 obtained 

by inserting between the Xbal-SacI sites of the pBHOl 
plasmid (Clontech) the PGLOB-GCB cassette cleaved with 
the same enzymes (Xbal-SacI) from the vector of Figure 2. 
Figure 4 shows the plasmid named pPLT4100 obtained 
30 by inserting between the BamHI-SacI (blunt) sites of the 
pBHOl plasmid (Clontech) the PGLOB-GLA cassette 
comprising the PGLOB promoter (SEQ ID NO 6), the signal 
sequence of the basic 7S soy globulin (SEQ ID NO 7) and 
the cDNA of the human □ galactosidase A gene (SEQ ID NO 
35 8), indicated as GLA, deleted of the nucleotides coding 
for the signal peptide (i.e., deleted up to nucleotide 
116) . The PGLOB-GLA cassette was obtained by constructing 
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plasmids analogous to those shown in figures 1 and 2 in 
which the cDNA cloned is that of human ct-galactosidase A 
(SEQ ID NO 8) deleted of the nucleotides encoding the 
signal peptide using the primers having sequences SEQ ID 
5 NO 10 and SEQ ID NO 11, which add a BamHI restriction 
site at 5' , and the ECORV (blunt) restriction site at 3' 
of the amplified DNA, respectively. 

Figure 5 shows the plasmid named pPLT4200 obtained 
by inserting between the Smal-SacI sites (Sad is cleaved 
10 with the blunt EcoCRI isoschizometer) of the pBHOl 
plasmid (Clontech) the PGLOB-GAA cassette comprising the 
PGLOB promoter (SEQ ID NO 6), the signal sequence of the 
basic 7S soy globulin (SEQ ID NO 7) and the cDNA of the 
human acid a-glucosidase gene (SEQ ID NO 12) indicated as 
15 GAA deleted of the nucleotides encoding the signal 
sequence (i.e. deleted up to nucleotide 426). The PGLOB- 
GAA cassette was obtained by constructing plasmids 
analogous to those shown in figures 1 and 2 in which the 
cloned DNA is that of human acid a-glucosidase (SEQ ID NO 
20 12) , deleted of the nucleotides coding for the signal 
peptide using the primers having SEQ ID 14 and SEQ ID 15, 
which add the EcoRV (blunt) restriction site at 5' and at 
3' of the amplified DNA. 

Figure 6 reproduces the result of an agarose gel 
25 electrophoresis of the DNA from PCR amplification carried 
out with primers specific for the human 
glucocerebrosidase gene (SEQ ID NO 4 and 5), on DNA 
extracted from plants transformed with the construct of 
Figure 3. The amplified fragment of about 1500 base pairs 
30 (bp) corresponds to the human glucocerebrosidase gene 
transformed in tobacco and present in the genome of 
nearly all the kanamycin-resistant tobacco lines 
obtained. The molecular weight marker ladder: 1Kb ladder 
(Promega) (bands from bottom to top lOOObp, 1500bp f 
35 2000bp, 3000bp, 4000bp, 5000bp, 6000bp and 7000bp) was 
loaded in the well of lane S. The product of DNA 
amplification of nine transformed tobacco plants was 
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loaded in the wells of lanes 1 to 9. The product of the 
amplification carried out on the DNA of wild tobacco 
plants, not containing a homologous sequence amplifiable 
with the same primers, was loaded in the well of the C 
5 lane. 

Figure 7 reports the results of Northern analysis 
performed on RNA extracted from immature seeds (Days 
After Pollination, DAP, 20) of tobacco lines, transformed 
with the plasmid of Figure 3 and tested positive at PCR 

10 control for gene presence detection, reported in Figure 
6. Total RNA extracted with the Trizol® method was loaded 
(10*Vg/well) on agarose gel, subjected to 
electrophoresis, transferred on a nylon membrane and 
hybridised with a radioactive probe obtained by 

15 amplifying a human glucocerebrosidase gene fragment. 
Total RNA extracted from transgenic plants was loaded in 
lanes 1 to 8; C- is the negative control consisting of 
total RNA of a wild-type tobacco plant, C+ is the 
positive control consisting of total RNA (15Dg) extracted 

20 from human placenta. The dimensions of the underlined 
band correspond to those expected for the messenger RNA 
(mRNA) of the GCB gene. The gene is not transcribed in 
plants 5, 6 and 8. 

Figure 8 reports a SDS-PAGE gel of proteins 

25 partially purified from seeds of transgenic tobacco 
plants transformed with the plasmid of Figure 3 (lanes 1 
to 8) reported in Figures 6 and 7, and their Western 
analysis. Western analysis was carried out on the total 
raw protein extract (1 mg) of mature seed, separated by 

30 SDS-PAGE electrophoresis. The analysis was carried out 
with chemiluminescence techniques, using a rabbit 
polyclonal antibody specific for human glucocerebrosidase 
enzyme as primary antibody, and a peroxidase-conjugated 
anti-rabbit IgG as secondary antibody. C- indicates the 
35 negative control, consisting of proteins extracted from 
wild-type tobacco seed; C+ indicates the positive 
control, consisting of commercial human 
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glucocerebrosidase enzyme (50ng) . 

Figure 9 reports the expression profile of the PGLOB 
promoter in the tobacco seed of a transgenic line, 
regulating the expression of the GCB gene in the more 

5 productive lines. The different lanes contain total 
protein extracts obtained from the seed of a same line 
harvested at subsequent times after fertilization {DAP: 
day after pollination) . Western Blot analysis, following 
the same procedure indicated for Figure 8, ascertained 

10 the presence of the protein corresponding to the human 
glucocerebrosidase enzyme. Lane I: DAP 4; lane II: DAP 9; 
lane III: DAP 14; lane IV: DAP 18; lane V: DAP 22; lane 
VI: complete maturation; lane C+: positive control, 
commercial product 50 ng; lane C-: negative control, 

15 total protein extract from wild-type tobacco plants. 

The results shown in this Figure highlight that in 
tobacco the soy PGLOB promoter has an activation profile 
which is optimal for the production of recombinant 
proteins, it being activated at about DAP 10 and 

20 expressed until maturation. 

Figure 10 reports the results of the deglycosilation 
enzyme treatment of the glucocerebrosidase protein 
purified from tobacco seed. Lane I: deglycosilated 
commercial human glucocerebrosidase enzyme, lane 2: 

25 control with proteins extracted from wild-type tobacco 
seed; lane 3: enzyme purified from seed and 
deglycosilated by deglycosilation enzyme treatment; lane 
4: enzyme purified from seeds and not subjected to 
deglycosilation . 

30 Figure 11 reproduces the result of an agarose gel 

electrophoresis of the DNA resulting from PCR 
amplification, carried out with primers specific for the 
human ct-galactosidase A gene (SEQ ID NO 10 and 11), on 
DNA extracted from plants transformed with the construct 

35 of Figure 4. The amplified fragment of about 1200 bp 
corresponds to the human a-galactosidase A gene 
transformed in tobacco and present in the genome of 



WO 03/073839 PCT/TT03/00120 

14 

nearly all the kanamycin-resistant tobacco lines 
obtained. The molecular weight marker ladder: 1Kb ladder 
(Promega) (bands bottom to top lOOObp, 1500bp, 2000bp, 
3000bp and 4000bp) was loaded in the well of lane S. The 
5 product from DNA amplification of nine transformed 
tobacco plants was loaded in the wells of lanes lane 2- 
11. The product of the amplification carried out on the 
plasmid used for the transformation was loaded in the 
well of lane C. 

10 Figure 12. Western Blot related to the in-seed 

expression control of two tobacco lines transformed with 
GCB under control of the 35S promoter, and to the DNA 
sequence coding for the signal sequence of the human GCB 
gene. Western analysis was carried out with 

15 chemiluminescence techniques, using a rabbit polyclonal 
antibody specific for human glucocerebrosidase enzyme as 
primary antibody, and a peroxidase-conjugated anti-rabbit 
IgG as secondary antibody. 

Lane 1: human glucocerebrosidase (50 ng) , purified 

20 from placenta and deglycosilated; lane 2: human 
glucocerebrosidase (50 ng) , non-deglycosilated and 
purified from placenta; lane 3: total proteins (1 mg) 
extracted from SR-S9 line seed; lane 4: total proteins (1 
mg) extracted from SR-S10 line seed; lane 5: total 

25 proteins (1 mg) extracted from SR-S11 line seed; lane 6: 
total proteins (1 mg) extracted from SR-S12 line seed; 
lane 7: total proteins (1 mg) extracted from SR-S13 line 
seed; lane 8: total proteins (1 mg) extracted from SR-S14 
line seed. 

30 Detailed description of the Invention 

The lysosomal enzymes according to the present 
invention include but are not limited to: ct-N- 
acetylgalactosaminidase, acid lipase, aryl sulfatase A, 
aspartylglycosaminidase, ceramidase, a-f ucosidase, a- 

35 galactosidase A, a-galactosidase, galactosylceramidase, 
glucocerebrosidase, a-glucosidase, p-glucuronidase, 

heparin N-sulfatase, p-hexosaminidase, iduronate 
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sulf atase, a-L-iduronidase, a-mannosidase, -mannosidase, 
sialidase and sphingomyelinase. 

A person skilled in the art will know said enzymes, 
as well as their nucleotide and polypeptide sequences. The 
5 sequence encoding said enzymes might be isolated in 
accordance with any one known method. A preferred method 
for isolating said sequences is based on the RT-PCR 
technique. Suitable primers (including or not including 
the DNA sequence encoding the native signal sequence of 
10 the protein and the polyadenylation signal) may be 
designed from known sequences, and the cDNA of the desired 
gene may be cloned by RT-PCR from mRNA extracted from 
placental cells. 

The cloned sequences including the DNA encoding the 
15 native signal sequence and/or the polyadenylation signal 
may subsequently be modified, by enzymatic treatments or 
by further PCR amplification using suitable primers, so as 
to delete said sequences. 

In a preferred embodiment of the present invention, 
20 the lysosomal enzyme is human acid p-glucosidase 
(glucocerebrosidase or GCB) . The cDNA of said enzyme (SEQ 
ID NO 1) may be cloned from mRNA extracted from placental 
cells using the primers having SEQ ID NO 3 and 4. Said 
cDNA, containing the sequence coding for the GCB native 
25 signal sequence, may further be PCR-amplif ied using the 
primers having SEQ ID NO 4 and 5, which delete said 
sequence, i.e. nucleotides 1-57 of SEQ ID NO 1, and 
adding restriction sites suitable for an easier insertion 
of the amplified product in the recombinant expression 
30 vector. The polyadenylation sequence is already absent 
from SEQ ID NO 1. 

In another preferred embodiment of the present 
invention, the lysosomal enzyme is human a-galactosidase 
A (a-GalA or GLA) . The cDNA of said enzyme (SEQ ID NO 8) 
35 already deleted of the portion encoding the signal 
peptide (i.e. to nucleotide 116), may be cloned from mRNA 
extracted from placental cells using the primers having 
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SEQ ID NO 10 and 11. Said primers add restriction sites 
suitable for an easier insertion of the amplified product 
in the recombinant expression vector. The polyadenylation 
sequence is already absent from SEQ ID NO 8. 
5 In yet another preferred embodiment of the present 

invention the lysosomal enzyme is human acid <x- 
glucosidase (GAA) . The cDNA of said enzyme (SEQ ID NO 12) 
already deleted of the portion encoding the signal 
peptide (i.e., to nucleotide 426), may be cloned from 

10 rnRNA extracted from placental cells using the primers 
having SEQ ID NO 14 and 15. Said primers add restriction 
sites suitable for an easier insertion of the amplified 
product in the recombinant expression vector. The 
polyadenylation sequence is already absent in SEQ ID NO 

15 12. 

The recombinant expression vector for plant 
transformation of the present invention may be any one 
known vector suitable for the transformation of plant 
cells and for the expression of protein products in them. 

20 Said vector may be cleaved at the most appropriate 
restriction sites, and in it, an expression cassette 
according to the present invention may be inserted. 

Suitable plant transformation vectors according to 
the present invention include, but are not limited to, 

25 Agrobacterium Ti plasmids and derivatives thereof, 
including both integrative and binary vectors, plasmids 
pBIB-KAN, pGA471, pEND4K, pGV3850, pMON505 and pBHOl. 
Included among the vectors of the present invention are 
also DNA or RNA plant viruses like, e.g., cauliflower 

30 mosaic virus, tobacco mosaic virus and their engineered 
derivatives genetically suitable for the expression of 
lysosomal enzymes. Moreover, transposable elements may be 
used in conjunction with any vector to transfer the 
expression cassette of the present invention into the 

35 plant cell. 

Preferred vector for the purposes of the present 
invention is the expression plasmid pBHOl in which an 
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expression cassette is inserted. 

The expression cassette according to the present 
invention comprises a promoter of a plant gene, stage- 
specific and specific for the expression in seed storage 
5 organs. As stage-specific, it is meant a promoter 
inducing the expression of the gene it controls at a 
specific seed development stage. 

A promoter suitable for practicing of the present 
invention is the promoter of the basic 7S soy globulin 
10 (SEQ ID NO 6) . The expression cassette according to the 
present invention further comprises a DNA sequence coding 
for the signal sequence of a plant protein able to 
dispatch said lysosomal enzyme to the seed storage organs 
and to ensure the post-translational modifications 
15 required for the expression of the enzyme in 
enzymatically active form, 

A signal sequence suitable for practicing the 
present invention is the signal sequence of basic 7S soy 
globulin (SEQ ID NO 7) . 
20 The expression cassette according to the present 

invention also contains a DNA sequence encoding a 
lysosomal enzyme of the ones listed above, deleted of the 
sequences encoding the native signal sequence and the 
polyadenylation signal . 
25 Moreover, the expression cassette of the present 

invention contains a polyadenylation signal (or that 
already present in the expression vector can be used) . 

The elements constituting the expression cassette of 
the present invention should be functionally linked, in 
30 the aboveindicated order in the 5'-> 3' direction. 

Optionally, the nucleotide sequence encoding the 
enzyme may be preceded by a short sequence apt to ease 
purification of the same protein on affinity columns, 
like e.g., his 6x tag, FLAG® (Sigma) or GST (Amersham) . 
35 in a preferred embodiment of the present invention, 

the vector derives from pBHOl plasmid (Clontech) in 
which the abovedescribed expression cassette is inserted. 
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Said cassette comprises, besides the DNA encoding the 
desired lysosomal enzyme, the promoter of the basic 7S 
soy globulin gene (PGLOB) (SEQ ID NO 6) and the DNA 
coding for the signal sequence of the gene of the basic 
5 7S soy globulin (SEQ ID NO 7). Said expression cassette, 
inserted in pBHOl vector (Clontech) upstream of the 
polyadenylation signal already present in the vector, 
contains the PGLOB promoter and the DNA sequence encoding 
the signal sequence of the basic 7S globulin fused to the 

10 cDNA of the human lysosomal enzyme (e.g., GCB SEQ ID NO 
1, a-GalA SEQ ID NO 8 and GAA SEQ ID NO 12) deleted of 
nucleotides coding for the signal sequence of the native 
enzyme and of any nucleotide not coding the mature 
enzyme; i.e., in the case of SEQ ID NO 1 nucleotides 1- 

15 57, in the case of SEQ ID NO 8 nucleotides 1-116 and, in 
the case of SEQ ID NO 12, nucleotides 1-426. 

Plants suitable for practicing the present invention 
are those with seeds exhibiting a high protein content, 
among which Leguminosae, cereals and tobacco are 

20 particularly suitable. 

In fact, as indicated above, protein content in 
seeds is of about the 25-35% in Leguminosae, of about the 
10-15% in cereals and of about the 20% in tobacco. 

In the preferred embodiment of the present invention 

25 tobacco plant was used. 

The transformation of the plant cells from which the 
plant according to the present invention is regenerated 
may be carried out by any one technique known to a person 
skilled in the art, like Agrobacteriu/n-mediated 

30 transformation of leaf discs or of other plant tissues, 
microinjection of DNA directly into plant cells, 
electroporation of DNA into plant cell protoplasts, 
liposome or spheroplast fusion, micropro jectile 
bombardment and the transfection of plant cells or 

35 tissues with appropriately engineered plant viruses. 

The invention can be practiced by transforming or 
transfecting plant cells with recombinant expression 
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vectors containing the expression cassette according to 
the present invention and selecting the transf ormants or 
transf ectants expressing the enzyme. The transf ormants 
may be selected by selection markers commonly known in 
5 the state of the art. Plant transf ormants are selected 
and induced to regenerate fertile whole plants able to 
form seeds expressing the lysosomal gene in enzymatically 
active form following agritechnical methods known in the 
specific field. In the preferred embodiment of the 

10 present invention, the transformation of the plant cells 
from which the plant is regenerated is carried out with 
AgroJbacteriu/n cells made competent by electroporation . 
The strain with the vector is used to transform leaf 
discs (LD) . Formation first of shoots and then of roots 

15 was induced from calluses formed on LD in the presence of 
selective antibiotic. The plant genetically transformed 
according to the present invention is a stable transgenic 
plant whose genetic information, inserted subsequently to 
the tranforming, is present and expressed (without gene 

20 silencing phenomena) in the subsequent generations. 

The method for extracting the enzyme produced in 
seed according to the present invention may be any one 
standard method for extracting protein from plant tissues 
known to a person skilled in the art. Said method 

25 provides seed grinding in liquid nitrogen in a suitable 
buffer, centrifuging, supernatant recovering, filtering 
and further enzyme purification by normal or affinity 
chromatography . 

In a preferred embodiment, the extraction buffer 

30 used consists of: sucrose, ascorbic acid, Cys-HCl, Tris- 
HC1, pH6 EDTA; centrifuging takes place at 2-10°C, 
preferably at 4°C, at 14.000 rpm for 5-60 min, preferably 
for 30 min, filter porosity is of about 0.2 Jim and the 
HPLC chromatography column is any column at a weak 

35 cationic exchange suitable for the purpose. Preferably, 
lane Resource Q column (Pharmacia), at a weak cationic 
exchange with elution in phosphate buffer pH 6 and NaCl 
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gradient 20-100%, is used. For the affinity column, the 
specific antibodies are bound to affigel-type resins 
(Biorad) and the elution is carried out with ethylene 
glycol . 

5 The enzyme extracted from the seed may be used for 

the preparation of medicaments suitable for enzyme 
replacement therapy. Therefore, preferably said enzyme 
will exhibit a high concentration in seed, i.e. between 
the 0.8 and the 1.5%, preferably between the 0.8 and the 

10 1%. 

Optionally, since the in-plant produced enzyme could 
contain N-linked glycans different from those present in a 
human product, said enzyme could be modified post- 
extraction in order to avoid any immunogenic reactions in 

15 a patient requiring a regular infusion of the 
glycoprotein. In vitro modifications could remove the 
xylose residue normally bound to the mannose of plant- 
produced glycans, according to techniques already used on 
proteins produced with a CHO cell-based system. 

20 The following examples are meant to provide a more 

detailed description of the invention without however 
limiting to them the object that is being claimed. 
EXAMPLES 
Example 1 : 

25 Construction of the vector for the expression of human 
glucocerebrosidase in plant seeds . 

The GCB gene coding for human glucocerebrosidase was 
cloned, by RT-PCR technique, from mRNA purified from 
placenta with primers having SEQ ID NO 3 and 4. The gene 

30 was isolated, amplifying the DNA having SEQ ID NO 1 with 
the primers having SEQ ID NO 4 and 5, in its structural 
portion deleted of the signal peptide (i.e., deleted of 
the signal sequence of the human gene, namely the 
nucleotides 1-57 of SEQ ID NO 1) and of the poly-A site 

35 (which is not amplified by said primers) and cloned in 
pGEM®-T (Promega) to form the plasmid named pGEM-GCB 
(Figure 1) . The primers designed for amplification add 
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the Smal restriction site at 5' and the Sad restriction 
site at 3' - The sequence of the natural gene obtained, 
which at sequence control tested identical to the 
published one (Sorge et al., 1985, Proc. Natl. Acad. Sci. 

5 82: 7289-7293), was cloned in the vector named pPLT2100, 
in the Smal and SacI sites, under control of the PGLOB 
promoter (Figure 2) to form the plasmid named pPLT4000 
(Figure 3) . After accurate control by restriction, the 
resulting pPLT4000 plasmid was used for genetic 

0 transformation of plants. 
Example 2 : 

Genetic transformation of plants with pPLT4000 plasmid 
and general results 

pPLT4000 plasmid was transferred in A. tumefaciens 
5 strain EHA105 cells, made competent, by electroporation . 
The strain with the plasmid was used to transform about 
300 leaf discs of tobacco cultivar (cv) Xanthi. From 
calluses generated onto leaf discs in the presence of 
kanamycin first shoot and then root initiation was 
0 induced. Rooted plantlets were potted, and at least 150 
kanamycin-resistant plants were analysed. 

Plants T 0 were tested by PCR (Figure 6) and plants 
Ti by Northern (Figure 7) and Western (Figure 8) 
analyses . 

5 All plants with GCB gene under control of PGLOB 

promoter led to accumulation of a protein, recognised by 
GCB-specific antibodies, having a molecular weight equal 
to 58 kDa corresponding to the glycosilated human protein 
(Figure 8) . The presence of the recombinant protein 

0 exclusively in the seed and not in the leaves was assayed 
in all the examined transgenic plants. 

Recombinant GCB protein isolated from seed and 
purified with HPLC techniques showed to be identical to 
the natural protein with concern to the molecular weight. 

5 Treatment with deglycosilation enzymes confirms the 
presence of posttranslational modifications in all alike, 
at present at least in quantitative terms, those present 
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in native glucocerebrosidase (Figure 10) . 
EXAMPLE 3: 

Agrrobacterium Tumefacl ens -media ted tobacco transformation 

Day 1: a small amount of Agrobacterium tumefaciens 
5 of strain EHA 105, taken from a Petri plate culture with 
a sterile loop so as not to exceed in the amount, thereby 
avoiding subsequent problems in controlling bacterial 
proliferation on plated leaf discs, was inoculated in 2 
ml of sterile LB. Then, from a healthy tobacco plant cv 

10 Xanthi a leaf showing no alteration whatsoever, 
conversely exhibiting optimal turgor conditions, was 
taken. The leaf was briefly rinsed with bidistilled water 
to remove surface impurities, immersed for 8 min in a 20% 
sodium hypochlorite and 0.1% SDS solution and left to dry 

15 under a vertical flow hood. From then on, all steps were 
carried out under hood. In particular, the leaf was 
immersed in 95% ethanol and shaken in order to drench its 
two pages (letting the petiole emerge) for 30-40 sec. The 
leaf was then allowed to dry out completely. 

20 Discs were obtained from the entire leaf surface 

with an ethanol-sterilised punch, let fall on plates with 
antibiotic-free MS10; in particular, the ratio of 30 
discs per plate was not exceeded. Next, 2 ml LB + (just 
inoculated) Agrobacterium were poured on plate, and the 

25 bacterial suspension was evenly spread over the entire 
plate with a gentle rotary motion, in order to obtain an 
homogeneous bacterial distribution among the discs. LB in 
excess was carefully aspirated with a pipette. At all 
times in the course of those steps a parallel negative 

30 control was performed by means of a plate to which 
nothing, or only LB was added. 

Then plates were incubated at 28 °C for 24-48 hours 
under constant lighting conditions, and bacterial growth 
was indicated by the appearance of a thin opaque layer 

35 spreading over the entire plate. 
Day 2 

Leaf discs (LD) were carefully transferred on a 
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plate with MS10 + 500 mg/1 cephotaxime / and incubated at 
28°C for 6 days under constant lighting conditions. This 
step determines Agrobac ter ium inactivation . 
Day 8 

LD were then carefully transferred on a plate with 
MS10 + 500 mg/1 cephotaxime and 200 mg/1 kanamycin, and 
incubated at 28°C for 14 days, under constant lighting 
conditions. This step determined a selection of the 
transformed plants: in fact, kanamycin resistance gene 
was carried by the plasmid inserted into Agrobac ter ium. 

Day 22 

LD, that in the meantime had grown developing a 
callus, were carefully transferred on a plate with MS10 + 
500 mg/1 cephotaxime 500 mg/1, 200 mg/1 kanamycin and 500 
mg/1 carbenicillin, and incubated for 6 days. This step 
determines elimination of the Agrobacteria possibly 
survived to previous antibiotic treatments (a very 
frequent occurrence) . 

Day 28 

LD were transferred again on MS10+500 mg/1 
cephotaxime and 200 mg/1 kanamycin, and incubated until 
shoot appearance. Then, shoots exhibiting at least two 
leaves were separated from the callous mass and 
transferred in the radication medium: MSO + 500 mg/1 
cephotaxime and 200 mg/1 kanamycin. 

At root appearance, the seedlings were extracted 
from the plate, freed from agar residues, gently rinsed 
with running water and planted out in loam and sand (2:1) 
inside small plastic pots. Soil was previously saturated 
with water and then pots were covered with transparent 
plastic lids to preserve high humidity conditions, and 
placed in a growth chamber at 25°C, with a daily 16-hour 
lighting period. 

The presence of the construct containing the DNA 
coding for the GCB was assayed by PCR in plants T 0 
(Figure 6) and, subsequently, in plants Ti-T 5 . The 
presence of the transcript was assayed by Northern 
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blotting in plants Ti (Figure 7) as well as the presence 
of the protein by Western blotting (Figure 8) and, 
subsequently, in plants T 2 to T 6 . The presence of the 
construct in generations 0-5 and of the transcript as 
5 well as of the protein in generations 1-6 demonstrates 
the stability of the transformation effected, and the 
absence of gene silencing phenomena in all examined 
generations subsequent to the first one. 
EXAMPLE 4: 

10 Purification of glucocerebrosidase protein from different 
tissues of the plant and assessment of molecular weight. 

Extraction of all the tobacco seed proteins was 
performed grinding the seeds in liquid nitrogen in the 
presence of an extraction buffer (0.5 M sucrose, 0.1% 
15 ascorbic acid, 0.1% Cys-HCl, 0.01 M Tris-HCl, 0.05M EDTA 
pH 6) . 

Then the resulting solution was centrifuged at 
14.000 rpm for 30 min at 4°C and the supernatant with the 
soluble proteins was recovered. 
20 The solution was filtered with filters of 0.2 |im 

porosity, and the glucocerebrosidase partially purified 
by removing proteins of a <30 KDa molecular weight by 
centrifugation in Centricon 30 column (Amicon) • 

The glucocerebrosidase was further purified by HPLC 
25 chromatography on Resource Q column (Pharmacia) at a weak 
cationic exchange, with elution in phosphate buffer pH 6 
and NaCl gradient 20-100%. The peak corresponding to 
glucocerebrosidase eluted at 0.6 M NaCl. 

The elution fractions were reunited and filtered in 
30 Centricon 30 to remove salt. 

For the glucocerebrosidase extraction from tobacco 
seeds, up to the centrifugation step the Inventors 
proceeded as in the case of extraction from seed, then 
the supernatant was additioned with 60% (NH 4 ) 2 S0 4 and left 
35 shaking in ice for 60 min. 

Then the solution was centrifuged at 14.000 rpm for 
15 min at 4°C, the pellet recovered and then resuspended 
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in phosphate buffer pH 6,8. 

For the assessment of molecular weight in SDS-PAGE, 
the staining agent (SDS loading buffer) was additioned to 
the glucocerebrosidase sample (20^1) and the samples were 
loaded onto 10% polyacrylamide minigels. Running 
conditions were: initially 10mA, and 20mA for the entire 
run, in Tris-glycine lx buffer. Then the gel was stained 
by Silver staining technique and the molecular weight 
assessed referring to molecular weight standards. 
EXAMPLE 5: 

Western analysis of the glucocerebrosidase protein 
produced in plant and deglycosilation thereof. 

Glucocerebrosidase purified from seed according to 
example 5, after electrophoretic separation on acrylamide 
gel, was transferred by electroblotting (buffer 25mM 
Tris, 192 mM glycine, 20% methanol, 45 V at 4°C) onto a 
nitrocellulose membrane (BA85 Schleicher and Schull) . 

The membrane with the immobilised protein was shaken 
for 60 min in TBS-T 5% Skim milk solution and then, after 
at least three rinsings with TBS-T, the membrane was 
incubated for 60' at room temperature, shaken with TBS-T 
5% Skim milk and primary antibody (rabbit polyclonal 
antibody specific for human GCB) in a 1:2500 ratio. 

After reaction with the primary antibody, the 
membrane was rinsed at least 3 times with TBS-T and then 
incubated, shaken for 60' with the secondary antibody 
{Peroxidase-conjugated anti-rabbit IgG) , always in TBS-T 
5% Skim milk solution, in a 1:12.000 ratio. 

After reaction with the secondary antibody, the 
membrane was rinsed at least 3 times with TBS-T and 
placed in contact with Amersham's chemiluminescence kit 
ECL™ solutions. 

Then the membrane was exposed in contact with a 
photoplate (Hyperfilm™ MP, Amersham) in darkroom for 
variable lengths of time (Figure 8) . 

Deglycosilation with N-glycosidase A and F enzymes 
(Calbiochem, Boehringer Mannheim) was carried out using 
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10 nl b/v of glycopeptide (10 \xg) denatured, in 0.1% SDS 
brought to boiling point for 2 min. 

90 |il of buffer (20 mM phosphate buffer pH 7.2, 50 
mM EDTA pH 8, 10 mM sodium azide, 0.5% NP40, 1% p- 

5 mercaptoethanol) were additioned to this solution, which 
was brought to boiling point again for 2 min, then cooled 
at 37°C. To the resulting 100 |il, 1 U of N glycosidase F 
and 1 U of N glycosilase A were additioned, and let 
incubate at 37 °C for 18 hours. Then the reaction product 

10 was analysed on SDS-PAGE gel and the glucocerebrosidase 
protein detected by Western technique (Figure 10) . 
Example 6 : 

Assessment of enzymatic activity of the protein produced 
and accumulated in seed. 

15 The enzymatic assay was carried out in 200 \il of an 

100 mM buffer K phosphate, pH 5, 0.15% triton x-100, 
0.125% Na taurocolate, 4-MUG (4 methyl-umbellif eryl 
glucopyranoside) solution, at 37°C for 60' • 

The reaction was terminated by addition of 1 ml of a 

20 0.1M glycine solution pH 10 and the reading was carried 
out at the spectrophotometer with 340 and 448 nm 
wavelengths. The calibration curve of the instrument was 
effected using a commercial enzyme product. For each 
sample, the assay was repeated in triplicate using 250 ng 

25 of partially purified protein. The results are reported 
in Table 1. 

Table 1. p-glucocerebrosidase enzymatic activity 
measured in total extracts of proteins from tobacco seed. 



Tobacco line 


Fluorescence 


WT 


1.4 ± 0.6 


SGI 


26.1 ± 1.5 


SG2 


34.7 ± 1.8 


SG10 


37.0 ± 1.5 


SG3 


21.4 ± 0.9 


SG4 


22.5 ± 1.1 


SG17 


43.9 ± 1.7 
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SG20 


18.8 ± 1.2 


SG22 


21.8 ± 1.4 


SG31 


28.9 ±1.6 


SGB 


19.2 ± 1.0 



Example 7 : 

Quantitative determination of the protein 

The assessment of human GCB accumulation capability 

5 of the various transgenic lines was carried out comparing 
the amount of GCB protein in seed to known amounts of 
commercial GCB. Protein amount was determined after 
electrophoretic separation on SDS-PAGE gel and detection 
with a polyclonal primary antibody, produced in rabbit and 

10 specific for human GCB and with a peroxidase-conjugated 
anti-rabbit IgG secondary antibody. The specific band was 
detected with a scanner, and the protein amount was 
determined comparing the intensity of the band to that of 
bands containing a known amount of protein purified and 

15 present in the same gel at different dilutions. Thus, 
tobacco lines were identified whose protein extract 
produces 8-10 jag GCB per mg of seed-extracted total 
proteins, i.e. corresponding to an amount ranging from the 
0.8% to the 1% of the total proteins of the seed. 

20 Example 8: 

Construction of the vector for the expression of human a- 
galactosidase A in plant seeds. 

The GLA gene coding for human ct-galactosidase A 
was cloned, by RT-PCR technique, from mRNA purified from 

25 placenta, with primers having SEQ ID NO 10 and 11. The 
gene was isolated, in its structural portion also deleted 
of the signal peptide (i.e. deleted of nucleotides 1-116 
of SEQ ID NO 8) and of the poly-A site (not amplified by 
said primers) by amplifying the DNA having SEQ ID NO 8 

30 starting from nucleotide 117 with the same primers, and 
cloned in pGEM®-T (Promega) to form the plasmid named 
pGEM-GLA. The primers designed for the amplification add 
the BamHI restriction site at 5' and the EcoRV 
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restriction site at 3' . The natural gene obtained, which 
at sequence control tested identical to the published one 
(Tsuji S. et al., 1987, Eur. J. Biochem., 165(2), 275- 
280) in the portion encoding the mature peptide, was 
5 cloned in a vector analogous to the one named pPLT2100 of 
example 1 under control of the PGLOB promoter to form the 
plasmid named pPLT4100 (Figure 4) . After accurate control 
by restriction, the resulting pPLT4100 plasmid was used 
for genetic transformation of plants. 

10 Example 9: 

Genetic transformation of plants with pPLT4100 plasmid 

Tobacco plant transformation was carried out using 
the vector named pPLT4100 according to the same 
methodologies used in examples 2 and 3. 

15 The presence of the construct containing the DNA 

coding for GLA was assayed by PCR in plants T 0 (Figure 
ID . 

Example 10: 

Construction of the vector for the expression of 
20 human acid a-glucosidase in plant seeds. 

The GAA gene coding for human acid D-glucosidase 
was cloned deleted of the region encoding the signal 
peptide, by RT-PCR technique, from mRNA purified from 
placenta, with the primers having sequences SEQ ID NO 14 

25 and 15. The gene, in its structural portion deleted of the 
signal peptide as well (i.e. deleted of nucleotides 1-426 
of SEQ ID NO 12) and of the poly-A site (not amplified by 
said primers) was isolated, amplifying the DNA having SEQ 
ID NO 12 starting from nucleotide 427 with the same 

30 primers, and cloned in pGEM®-T (Promega) to form the 
plasmid named pGEM-GLA. The primers designed for the 
amplification add the restriction site EcoRV to 5' and to 
3' . The natural gene obtained, which at sequence control 
tested identical to the published one (Hoefsloot L. H. et 

35 al., 1988, EMBO JOURNAL 7(6), 1697-1704) in the portion 
encoding the mature peptide, was cloned in a vector 
analogous to that named pPLT2100 of example 1 under 
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control of the PGLOB promoter to form the plasmid named 
pPLT4200 (Figure 5). After accurate control by 
restriction, the resulting pPLT4200 plasmid was used for 
genetic transformation of plants. 
5 Comparative example : 

Absence of GCB expression in seed using the 35S promoter 
of the Cauliflower mosaic virus 

In order to assess operation of the 35S promoter of 
the Cauliflower mosaic virus indicated in U.S. Pat. 
0 5,929,304 as promoter suitable for expressing lysosomal 
enzymes, tobacco plants were transformed using a 
recombinant expression vector comprising the sequence SEQ 
ID NO 1 under control of the promoter 35S. The proteins 
were extracted according to standard methods and the 
15 protein product was analysed by Western technique under 
chemiluminescence using a human GCB-specific rabbit 
polyclonal antibody as primary antibody, and a 
peroxidase-conjugated anti-rabbit IgG as secondary 
antibody. The results, reported in Figure 11, demonstrate 
20 that using the 35S promoter for expressing human 
glucocerebrosidase no accumulation of stable protein in 
the seed is obtained. 



